SiSSA: An Infrastructure for Developing NLP Applications
نویسندگان
چکیده
In recent years there has been a growing interest in the commercial deployment of NLP technologies. This paper presents SiSSA, a project whose main aim is that of developing an infrastructure for prototyping, editing and validation of NLP application architectures. The system will provide the user with a graphical environment for (1) selecting the NLP activities relevant for the particular NLP task and the associated linguistic processors that execute them; (2) connecting new linguistic processors to SiSSA; (3) checking that the chosen architectural hypothesis corresponds to the functional specifications of the given application. The proposed infrastructure makes crucial use of state-of-the-art software technologies (CORBA, XML, RDF) to integrate different linguistic processors in an effective way. In the paper the definition of a metaformalism for the unification of different formalisms for grammar description is also briefly presented.
منابع مشابه
SiSSA - An Infrastructure For NLP Application Development
Recently there has been a growing interest in infrastructures for sharing NLP tools and resources. This paper presents SiSSA, a project that aims at developing an infrastructure for prototyping, editing and validation of NLP application architectures. The system will provide the user with a graphical environment for (1) selecting the NLP activities relevant for the particular NLP task and the a...
متن کاملSoftware Infrastructure for Natural Language Processing
We classify and review current approaches to software infrastructure for research, development and delivery of NLP systems. The task is motivated by a discussion of current trends in the field of NLP and Language Engineering. We describe a system called GATE (a General Architecture for Text Engineering) that provides a software infrastructure on top of which heterogeneous NLP processing modules...
متن کاملSoftware Infrastructure for Natural Language
We classify and review current approaches to software infrastructure for research, development and delivery of NLP systems. The task is motivated by a discussion of current trends in the eld of NLP and Language Engineering. We describe a system called GATE (a General Architecture for Text Engineering) that provides a software infrastructure on top of which heterogeneous NLP processing modules m...
متن کاملOrder–disorder phase boundary between ice VII and VIII obtained by first principles
Department of Geology and Geophysics, University of Minnesota, 421 Washington Ave., SE, Minneapolis, MN 55455, USA Minnesota Supercomputing Institute and Department of Chemical Engineering and Materials Science, University of Minnesota, 421 Washington Ave., SE, Minneapolis, MN 55455, USA c Scuola Internazionale Superiore di Studi Avanzati (SISSA) and INFM DEMOCRITOS National Simulation Center, ...
متن کاملA Large-Scale Web Data Collection as a Natural Language Processing Infrastructure
In recent years, language resources acquired from theWeb are released, and these data improve the performance of applications in several NLP tasks. Although the language resources based on the web page unit are useful in NLP tasks and applications such as knowledge acquisition, document retrieval and document summarization, such language resources are not released so far. In this paper, we prop...
متن کامل